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prediction  ‘ 

Reader  Aids  — 

Purpose:  Clarify  a  problem  in  original  paper 
Special  math  needed  for  explanations:  Elementary  probability 
theory 

Special  math  needed  to  use  results:  Elementary  probability  and 
statistics 

Results  useful  to:  Failure  data  (error  log)  analysts  and  reliability 
analysts 

Abstract  —  The  original  paper  used  “traditional  statistical 
analysis’’  to  demonstrate  the  superiority  of  the  proposed  disper¬ 
sion  frame  technique.  The  purpose  was  to  distinguish  between  tran¬ 
sient  and  intermittent  errors  and  predict  the  occurrence  of  inter¬ 
mittent  errors.  Tois  note  shows  that  those  traditional  statistical 
methods  were  too  “traditional’’  since  they  involved  fitting  a 
distribution  to  data  which  were  not  identically  distributed.  Ap¬ 
propriate  statistical  techniques  for  fitting  models  to  such  non¬ 
stationary  data  are  briefly  discussed,  and  reasons  are  profTered 
for  the  persistence  of  “too  traditional’’  statistical  methods  in  the 
reliability  literature. 


1.  INTRODUCTION 

Lin  &  Siewiorek  [1],  (henceforth  LS)  presented  heuristic 
techniques  for  deciding  if  computer  system  problems  could  be 
traced  to  faulty  hardware.  LS  also  used  “traditional  statistical 
analysis  methods’’  (their  terminology)  to  distinguish  between 
intermittent  and  transient  errors.  We  show  that  these  statistical 
techniques  were  "too  traditional”,  ie,  the  "standard  operating 
procedure"  of  fitting  a  distribution  to  data  which  were  not  iden¬ 
tically  distributed  was  adppted.  This  problem  is  discussed  in 
detail  in  section  2;  the  reasons  for  the  persistence  of  the 
misconceptions  which  have  led  to  this  prevailing,  but  inap¬ 
propriate,  approach  are  briefly  discussed  in  section  3.  Section 
4  contains  some  brief  observations  concerning  the  LS  heuristic 
approach. 

Standard  notation  is  given  in  "Information  for  Readers  & 
Authors”  at  the  rear  of  each  issue. 


2.  INAPPLICABILITY  OF 
"TRADITIONAL  STATISTICS” 


Ref  [1:  Introduction)  considered  the  situation  where  inter¬ 
mittent  faults  could  be  distinguished  from  transient  faults  based 
largely  on,  “the  fact  that  intemittent  errors  reoccur,  often  at 
an  increasing  rate”.  More  specifically,  [1:  section  3.2]  states, 
"Periods  of  increasing  error  rae.  which  appear  as  either  clusters 
of  errors  or  decreasing  intennival  times  between  errors  (sug¬ 
gesting  a  Weibull  failure  distribution  with  a  >  1),  are 
observed.”  The  situation  where  successive  interarrival  times 
between  errors  are  decreasing  cannot  be  modeled  by  successive 
samples  from  the  same  Weibull  distribution,  however,  and  this 
can  be  seen  most  readily  when  the  Weibull  hazard  function  is 
increasing  most  rapidly. 

For  any  Cdf,  Fx(x)  =  Pr{XSx},  the  corresponding 
hazard  function  is: 

ZxU)  “  Fx(.x)/Fx(x) 

Fx(x)  is  the  survivor  function  (complementary  Cdf). 

For  the  Weibull  distribution. 


zx{x)  -  X-ot-  (X-jr)®-1,  and  it  — 

decreases  in  x,  for  a<  1. 
is  invariant  in  x,  for  a=l, 
increases  in  x,  for  cr>  I. 


For  a  >  >  1 ,  zx(x)  increases  very  rapidly  as  x  increases,  and 
as  a— oo,  zx(jc)  becomes  an  impulse  at  1/X  [2:  p  185).  If  we 
keep  sampling  from  this  degenerate  distribution,  successive 
failures  occur  at  1  A,  2/X,  3/X _ ;  ie,  when  the  hazard  func¬ 

tion  increases  most  rapidly,  there  is  no  tendency  whatsoever 
for  successive  interarrival  times  to  become  smaller!  For  finite 
a,  there  is  sampling  variability  which  can  result  in  a  sequence 
of  interarrival  times  which  tend  to  get  smaller  —  however,  if 
we  keep  sampling  from  a  Weibull  distribution,  with  fixed 
parameters  X  and  (finite)  a,  sampling  variability  can  also  result 
in  a  sequence  of  interarrival  times  which  tend  to  become  larger, ' 
even  if  a  >  1 .  The  basic  point  is  that  whenever  we  sample  from  rAB 
a  renewal  process  (ie,  a  nonteminating  sequence  of  i.i.d.  in-  ce<* 
terarrival  times)  the  successive  iteiarrival  times  randomly  vary  -n. — 
around  their  common  mean,  which  for  the  Weibull  distribution 
is:  _ 


°Lv 

lAJLi  * 


:ra&i 


tool 


(I/X)-r(l  +  l/a). 


i 'lability 


Of  course,  when  a— oo,  the  mean  is  1/X,  as  anticipated,  since  Aw^f 


failure  always  occurs  at  1/X. 
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misconceptions  which  have  led  to  this  prevailing,  but  inap¬ 
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Ref  [I :  Introduction]  considered  the  situation  where  inter¬ 
mittent  faults  could  be  distinguished  from  transient  faults  based 
largely  on,  "the  fact  that  intennittent  errors  reoccur,  often  at 
an  increasing  rate”.  More  specifically,  [1:  section  3.2]  states, 
“Periods  of  increasing  error  ras.  which  appear  as  either  clusters 
of  errors  or  decreasing  intenrrival  times  between  errors  (sug¬ 
gesting  a  Weibull  failure  distribution  with  ot  >  1),  are 
observed.”  The  situation  where  successive  interarrival  times 
between  errors  are  decreasing  cannot  be  modeled  by  successive 
samples  from  the  same  Weibull  distribution,  however,  and  this 
can  be  seen  most  readily  when  the  Weibull  hazard  function  is 
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For  any  Cdf,  Fx{ x)  =  ?r{X£x},  the  corresponding 
hazard  function  is: 


ZxW  *  Fx(x)/Fx(x) 

Fx( x)  is  the  survivor  function  (complementary  Cdf). 
For  the  Weibull  distribution. 


zx{x)  =  X  ot- (X-x)“-1,  and  it  — 

decreases  in  x,  for  a<  1. 

is  invariant  in  x,  for  a  =  1, 

increases  in  x,  for  ot>  1. 

For  a  >  >  1,  zx(x)  increases  very  rapidly  as  x  increases,  and 
as  a—a>,  zx(x)  becomes  an  impulse  at  1/A  [2:  p  185].  If  we 
keep  sampling  from  this  degenerate  distribution,  successive 
failures  occur  at  1/X,  2/A  3/X _ ;  ie,  when  the  hazard  func¬ 

tion  increases  most  rapidly,  there  is  no  tendency  whatsoever 
for  successive  interarrival  times  to  become  smaller!  For  finite 
a,  there  is  sampling  variability  which  can  result  in  a  sequence 
of  interarrival  times  which  tend  to  get  smaller  —  however,  if 
we  keep  sampling  from  a  Weibull  distribution,  with  fixed  r  oT 
parameters  X  and  (finite)  a,  sampling  variability  can  also  result  _  — 
in  a  sequence  of  interarrival  which  tend  to  become  larger, 
even  if  o  >  1 .  The  basic  point  is  that  whenever  we  sample  from  r*8 
a  renewal  process  ( ie ,  a  nomrainating  sequence  of  i.i.d.  in-  CM 
teramval  times)  the  successive  mterarrival  times  randomly  vary  ~n  — 
around  their  common  mean,  which  for  the  Weibull  distribution 


(1/X) -rci  +  l/a). 


'lability 


Of  course,  when  o— «•,  the  mean  is  1/X.  as  anticipated,  since  Av^f**n 
failure  always  occurs  at  1/A  i 
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analysis  methods”  (their  terminology)  to  distinguish  between 
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“TRADITIONAL  STATISTICS” 

Ref  [1:  Introduction]  considered  the  situation  where  inter¬ 
mittent  faults  could  be  distinguished  from  transient  faults  based 
largely  on,  "the  fact  that  internment  errors  reoccur,  often  at 
an  increasing  rate”.  More  specifically,  [1:  section  3.2]  states, 
“Periods  of  increasing  error  rate,  which  appear  as  either  clusters 
of  errors  or  decreasing  interanival  times  between  errors  (sug¬ 
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observed.”  The  situation  where  successive  interarrival  times 
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samples  from  the  same  Weibull  distribution,  however,  and  this 
can  be  seen  most  readily  when  the  Weibull  hazard  function  is 
increasing  most  rapidly. 

For  any  Cdf,  Fx(x)  =  Prpf^jr},  the  corresponding 
hazard  function  is: 

zx(x)  s  Fx(x)/Fx(x) 

Fx(x)  is  the  survivor  function  (complementary  Cdf). 

For  the  Weibull  distribution. 

ZxM  =  X-a- (X-x)“-1,  and  it  — 

decreases  in  x,  for  a  <  1. 
is  invariant  in  x,  for  a  - 1, 
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For  or  >  >  I ,  zx(x)  increases  very  rapidly  as  x  increases,  and 
as  a— oo,  zx(x)  becomes  an  impulse  at  1/X  [2:  p  185].  If  we 
keep  sampling  from  this  degenerate  distribution,  successive 
failures  occur  at  1/X,  2/X,  3/X. ...;  ie,  when  the  hazard  func¬ 
tion  increases  most  rapidly,  there  is  no  tendency  whatsoever 
for  successive  interarrival  times  to  become  smaller!  For  finite 
a,  there  is  sampling  variability  which  can  result  in  a  sequence 
of  interarrival  times  which  tend  to  get  smaller  —  however,  if 
we  keep  sampling  from  a  Weibull  distribution,  with  fixed 
parameters  X  and  (finite)  a,  sampling  variability  can  also  result 
in  a  sequence  of  interarrival  times  which  tend  to  become  larger, 
even  if  a  >  1 .  The  basic  point  is  that  whenever  we  sample  from 
a  renewal  process  (ie,  a  nonterminating  sequence  of  i.i.d.  in¬ 
terarrival  times)  the  successive  interarrival  times  randomly  vary 
around  their  common  mean,  winch  for  the  Weibull  distribution 
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(1/X) -r(l  +  l/a). 

Of  course,  when  a—  eo,  the  mean  is  1  /X,  as  anticipated,  since 
failure  always  occurs  at  1/X. 
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sion  frame  technique.  The  purpose  was  to  distinguish  between  tran¬ 
sient  and  intermittent  errors  and  predict  the  occurrence  of  inter¬ 
mittent  errors.  This  note  shows  that  those  traditional  statistical 
methods  were  too  “traditional”  since  they  involved  fitting  a 
distribution  to  data  which  were  not  identically  distributed.  Ap¬ 
propriate  statistical  techniques  for  fitting  models  to  such  non¬ 
stationary  data  are  briefly  discussed,  and  reasons  are  proffered 
for  the  persistence  of  “too  traditional”  statistical  methods  in  the 
reliability  literature. 


1.  INTRODUCTION 

Lin  &  Siewiorek  [1],  (henceforth  LS)  presented  heuristic 
techniques  for  deciding  if  computer  system  problems  could  be 
traced  to  faulty  hardware.  LS  also  used  “traditional  statistical 
analysis  methods”  (their  terminology)  to  distinguish  between 
intermittent  and  transient  errors.  We  show  that  these  statistical 
techniques  were  "too  traditional’ ’,  ie,  the  “standard  operating 
procedure”  of  fitting  a  distribution  to  data  which  were  not  iden¬ 
tically  distributed  was  adppted.  This  problem  is  discussed  in 
detail  in  section  2;  the  reasons  for  the  persistence  of  the 
misconceptions  which  have  led  to  this  prevailing,  but  inap¬ 
propriate,  approach  are  briefly  discussed  in  section  3.  Section 
4  contains  some  brief  observations  concerning  the  LS  heuristic 
approach. 

Standard  notation  is  given  in  “Information  for  Readers  & 
Authors”  at  the  rear  of  each  issue. 


2.  INAPPLICABILITY  OF 
“TRADITIONAL  STATISTICS” 

Ref  [1:  Introduction]  considered  the  situation  where  inter¬ 
mittent  faults  could  be  distinguished  from  transient  faults  based 
largely  on,  “the  fact  that  internment  errors  reoccur,  often  at 
an  increasing  rate”.  More  specifically,  [1:  section  3.2]  states, 
“Periods  of  increasing  error  rue.  which  appear  as  either  clusters 
of  errors  or  decreasing  interarrival  times  between  errors  (sug¬ 
gesting  a  Weibull  failure  distribution  with  a  >  1),  are 
observed.”  The  situation  when:  successive  interarrival  times 
between  errors  are  decreasing  cannot  be  modeled  by  successive 
samples  from  the  same  Weibull  distribution,  however,  and  this 
can  be  seen  most  readily  when  the  Weibull  hazard  function  is 
increasing  most  rapidly. 

For  any  Cdf,  Fx(x)  =  Pr{Y<;c},  the  corresponding 
hazard  function  is: 

*x(*)  3  Fx(x)/Fx(x) 

Fx(x)  is  the  survivor  function  (complementary  Cdf). 

For  the  Weibull  distribution. 

zx(x)  =  X-Of- (X-jt)“~1,  and  it  — 

decreases  in  x,  for  or  <  1. 
is  invariant  in  x,  for  a  =  1, 
increases  in  x,  for  a  >  1. 

For  a  >  >  1 ,  zx(x)  increases  very  rapidly  as  x  increases,  and 
as  a—  oo,  zx(x)  becomes  an  impulse  at  1/X  [2:  p  185].  If  we 
keep  sampling  from  this  degenerate  distribution,  successive 
failures  occur  at  1/X,  2/X,  3/X. ...;  ie,  when  the  hazard  func¬ 
tion  increases  most  rapidly,  there  is  no  tendency  whatsoever 
for  successive  interarrival  times  to  become  smaller!  For  finite 
a,  there  is  sampling  variability  which  can  result  in  a  sequence 
of  interarrival  times  which  tend  to  get  smaller  —  however,  if 
we  keep  sampling  from  a  Weibull  distribution,  with  fixed 
parameters  X  and  (finite)  a,  sampling  variability  can  also  result 
in  a  sequence  of  interarrival  rimes  which  tend  to  become  larger, 
even  if  a  >  1 .  The  basic  point  is  that  whenever  we  sample  from 
a  renewal  process  (ie,  a  nomerminating  sequence  of  i.i.d.  in¬ 
terarrival  times)  the  successive  interarrival  times  randomly  vary 
around  their  common  mean,  winch  for  the  Weibull  distribution 
is: 

(i/X) -r(i  +  l/a). 

Of  course,  when  a— «,  the  mean  is  1/X,  as  anticipated,  since 
failure  always  occurs  at  1/X. 
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A  renewal  process  cannot  be  used  to  model  a  sequence  of 
interarrival  times  that  tend  to  decrease,  because  an  increasing 
hazard  function  is  a  property  of  one  interarrival  time,  rather 
than  of  a  property  of  a  sequence  o/ interarrival  times.  Figure 

1  portrays  a  sequence  of  interarrival  times  Xx,  X2,  Xj . The 

figure  distinguishes  between  local  time  x,  which  is  measured 
from  the  most  recent  error  event,  and  global  time  t,  which  is 
measured  from  the  origin  for  Xlt  regardless  of  the  number  of 
error  events.  The  rate  of  occurrence  pf  failures  of  a  sequence 
of  interarrival  times  is: 

v(f),m  dE{N(t)}/dt, 

Notation 

v(r)  rate  of  occurrence  of  failures  (rocof) 

N(t)  observed  number  of  failures  in  (0,t]. 

f-«—  X  I-*— X 

H — x, — - X2 - *+• - X3 - - 

K-  °  -  — . - .  -  — t 

Figure  1.  A  Sequence  of  Interarrival  Times 


A  necessary  condition  for  modeling  a  decreasing  sequence  of 
interarrival  times  is  to  use  a  model  for  which  v(r)  increases 
in  r;  this  is  not  a  sufficient  condition  (3:  pp  41-42].  The 
nonhomogeneous  Poisson  process  (NHPP)  [3:  pp  30-33]  is 
presented  as  a  suitable  candidate  by  Thompson  [4]  and  Rigdon 
&  Basu  [6],  and  in  [3:  pp  47-52].  Under  the  special  case  of 
a  power-law  process  [3:  p  101], 


times  to  become  shorter;  unfortunately,  under  the  incorrect  in-  tag 
terpretation  of  “failure  rate”  as: 

v(r)  =  d  E{N(t)}/dt,  it  appears  that  increasing  failure  rate 
corresponds  to  an  increasing  number  of  failures  per  unit  time 
[5,6],  The  almost  universal  use  of  “failure  rate”  for  both 
hx(x)  &  v(r)  by  practitioners  &  theorists  is  the  chief  cause  of 
the  lack  of  understanding  that  there  are  two  different  bathtub 
curves: 

•  hx(x)  plotted  against  x  for  nonrepairable  items 

•  v(r)  plotted  against  r  for  repairable  items  [7]. 

2a.  The  NHPP  with  v(r)  =  \-ot- (\-f)“-1  has  been 
referred  to  as  a  power-law  process  in  these  comments.  This 
NHPP  is  widely  and  misleadingly/improperly  known  as  a 
“Weibull  process”  in  the  literature.  There  is  some  connection 
between  the  so-called  “Weibull  process”  and  a  Weibull  distribu¬ 
tion,  ie,  under  the  power-law  process,  time  to  first  failure  is 
Weibull  distributed  [3:  pp  160-161];  but  there  are  major  dif¬ 
ferences  between  these  models  as  well.  There  is  just  enough 
connection  between  the  two  models  to  make  it  especially  im¬ 
portant  to  emphasize  the  major  distinctions  between  them  [3,6]. 
These  distinctions  are  blurred  by  the  misleading/improper  term, 
“Weibull  process”. 

2b.  Ref  [1 :  section  3.4]  used  “Weibull  process”  in  a  very 
different  sense,  viz,  a  “Weibull  process”  is  a  renewal  process 
with  Weibull  distributed  interarrival  times.  Since  this  is  another 
“natural”  interpretation  of  “Weibull  process”,  it  provides 
another  important  reason  for  not  using  “Weibull  process”  as 
a  synonym  for  power  law  NHPP.  □ 


I 

! 

i 


i 


v(/)  =  A-aMX-/)*-1; 

successive  interarrival  times  tend  to  decrease  for  a  >  1 ,  since 
the  rate  of  occurrence  of  failures  is  increasing.  As  explained 
in  section  3,  in  spite  of  superficial  similarities  between  this 
NHPP  and  the  Weibull  distribution,  there  are  crucial  differences 
between  these  models. 


3.  REASONS  FOR  MISCONCEPTIONS 

Ref  3  devotes  20  pages  [3:  pp  133-152]  to  tabulating  & 
describing  chronic  misconceptions  about  repairable  systems,  and 
devotes  17  pages  [3:  pp  152-168]  to  reasons  for  these  misconcep¬ 
tions.  Instead  of  trying  to  summarize  this  material  here,  only 
the  two  chief  causes  tof  the  widespread  misconception  (that  a 
sequence  of  successively  shorter  interarrival  times  can  be 
modeled  by  Weibull  distributions  with  cr  >  1)  are  presented. 

1.  The  term  “failure  rate”  is  almost  always  defined  as 
hx(x)  but,  as  emphasized  by  Thompson  [4],  "failure  rate” 
then  is  “naturally”,  and  erroneously,  interpreted  as  v(r). 1  As 
explained  in  section  2,  under  the  definition  of  "failure  rate” 
as  hx(x),  there  is  no  connection,  in  general,  between  increas¬ 
ing  “failure  rate”  and  a  tendency  for  successive  interarrival 


In  addition  to  problems  engendered  by  the  terms,  “failure 
rate"  and  “Weibull  process”,’  there  are  several  subtleties  en¬ 
countered  when  distinguishing  between  the  analysis  of  times 
to  failure  of  nonrepairable  items  and  the  analysis  of  the  interar¬ 
rival  times  of  a  repairable  system  [3:  pp  32-33,  pp  51-52].  As 
emphasized  in  [8],  reliability-oriented  mathematical  statisticians 
have  almost  ignored  the  discussion  of  these  distinctions  in  their 
papers  &  books  and  have  seldom  even  outlined  appropriate 
techniques  for  repairable  systems.  As  pointed  out  by  Newton 
[9],  for  example,  "...  it  is  essential  that  sequencing  is  taken 
into  account.  It  seems  remarkable  that  so  little  attention  has  been 
given  to  this  major  potential  pitfall.  Ascher  &  Feingold  [3]  com¬ 
ment  on  the  fact  that  among  the  hundreds  of  textbooks  on 
reliability,  only  two  (plus  their  own!)  make  any  reference  to 
the  need  to  take  sequencing  into  account.”  Moreover,  even 
when  repairable  systems  concepts  &  techniques  are  considered, 
the  treatment  is  often  very  confusing:  Basu  &  Rigdon  [10] 
observed,  "Much  of  what  is  wrinen  on  repairable  systems  con¬ 
tains  serious  misconceptions  and  poor  terminology.  As  Ascher 
&  Feingold  [3:  p  133]  state,  *...  the  prevalent  terminology  could 


'in  fact.  Thompson  defines  failure  rare  is  v(r)  since,  u  he  stresses,  (hat  is 
the  way  engineers  interpret  the  term  regardless  of  how  it  is  defined.  Ref  [3] 
stresses  that  statisticians  also  often  have  alien  into  this  semantic  trap,  which 
they  have  set  for  engineers  —  and  for  themselves! 
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scarcely  be  more  misleading  if  it  had  been  designed  to  mislead- 
specifically,  it  has  engendered  such  deep-seated  misconceptions 
that  it  is  extraordinarily  difficult  to  supplant  it  with  improved 
nomenclature’  It  is  not  surprising,  therefore,  that  statistical 
practitioners  have  been  led  astray  by  “traditional  statistical 
analysis  methods”! 


4.  COMMENTS  ON  THE  DISPERSION 
FRAME  TECHNIQUE 

Ref  [1:  section  4]  described  the  Dispersion  Frame  Tech¬ 
nique  (DFT)  and  illustrated  its  application  to  several  types  of 
computer  hardware.  The  five  heuristic  DFT  ruies  are  designed 
to  detect  clusters  of  errors  associated  with  a  specific  hardware 
problem  against  the  background  “noise”  of  randomly  occurring 
transient  errors.  From  the  LS  results,  the  DFT  usually  was  suc¬ 
cessful  in  isolating  such  clusters,  which  associated  with  specific 
hardware  problems.  We  have  a  few  additional  comments  on 
mathematical  analysis  of  the  luT  _H  hope  that  they  will 
stimulate  further  research. 

1 .  The  homogeneous  Poisson  process  (HPP)  is  the  model 
for  maximum  randomness  of  events  occurring  over  time.  As 
emphasized  by/  inlar  [11,  p  80],  however,  the  HPP  appears 
(to  the  naive  eye)  to  be  clustered  because  the  interarrival  times 
are  exponentially  distributed;  ie,  there  are  many  more  short  in¬ 
terarrival  times  than  long  ones.  It  would  be  useful,  therefore, 
to  use  formal  methods  for  distinguishing  between  the  HPP  and 
true  clustering.  Lewis  [12,  13]  provide  techniques  for 
distinguishing  between  an  HPP  and  true  clustering,  and  an 
NHPP  vr  an  NHPP  with  additional  clustering,  respectively. 
Lewis  [12]  specifically  applied  his  techniques  to  computer  hard¬ 
ware  problems;  Jenkins  [14]  summarized  the  practical  implica¬ 
tions  of  those  results. 

2.  Ref  [1:  section  4  (intro)]  claimed,  “These  five  [heuristic 
DFT]  rules  have  been  shown  to  mathematically  cover  a  range 
of  values  for  a,  the  Weibull  shape  parameter  observed  during 
the  data  analysis  in  section  3.”  LS  did  not  provide  such  a 
mathematically  oriented  connection  between  the  DFT  rules  and 
appropriate  values  of  a,  but  this  misconception  should  not  have 
been  put  forth  in  the  first  place.  That  is  (see  section  2  above), 
LS  inappropriately  estimated  the  shape  parameter  of  a  Weibull 
distribution,  rather  than  the  shape  parameter  of  a  power  law 
NHPP.  If  the  NHPP  is  the  appropriate  model,  then  the  estimate 
of  the  “Weibull  distribution  shape  parameter”  is  meaningless. 
When  interarrival  times  are  not  identically  distributed,  it  is 
meaningless  to  estimate  tlie  nonexistent  parameters  of  a  nonex¬ 
istent  common  distribution  of  interarrival  times.  Therefore,  the 
conclusion  stated  at  the  $nd  of  [1:  section  3.1]  that,  “a  >  1 
is  an  oversimplification  for  intermittent  faults”  is  unwarranted. 
Correspondingly,  the  conclusion  reached  near  the  end  of  [1: 
section  3.2]  that,  "simply  looking  for  a  greater  that  1  is  insuf¬ 
ficient  to  identify  the  trend  of  an  intermittent  fault”  is  equally 
inappropriate.  Moreover,  all  statements  in  the  paper  that  are 
based  on  fitting  Weibull  distributions  to  interarrival  times  are 
questionable  since  the  fitting  of  a  distribution  to  data  is  ap¬ 


propriate  only  when  there  is  no  evidence  that  data  are  not  iden¬ 
tically  distributed.  □ 

In  summary,  the  heuristic  ground  rules  for  the  DFT 
established  in  [1 :  section  4]  provide  appropriate  guidelines  for 
distinguishing  intermittent  problems  from  transient  "glitches”. 
The  results  of  [1:  section  3]  however,  were  based  on  "too  tradi¬ 
tional”  statistical  methods  and  will  be  revised  in  a  furore  paper. 
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II?;  ~  rllo  sup  f  [<K(bx)  -  <y  (bx)]dF(x) 
Oafcsl  Jo 

+  ..*g, 1 1  Kf)  ■  ♦■(f)] dFW 

*  c,  h;  -  r  ||S  +  c2« 

C,  and  Ci  are  constants  (independent  of  n). 

Since  Htfr*  -  <t>‘  ||o  —  0  a.s.  as  rt— oo  and  t  is  arbitrary  small, 
it  follows  that  ||  -  T  llo  “  0  as-  as  n~ 00 •  Hence,  from 

[17:  corollary  1,  p  48]  £,  converges  weakly  to  f.  The 
covariance  kernel  of  f  can  be  obtained  by  letting  'i'(u)  =  u 
in  cr2('lr)  in  theorem  1.  □ 
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